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Abstract 

The paper investigates the question of whether or not determinations of the distance between the Sun 
and the Galactic center Ro are affected by the so-called "bandwagon effect", leading to selection effects in 
published data that tend to be close to expected values, as was suggested by some authors. It is difficult 
to estimate numerically a systematic uncertainty in Ro due to the bandwagon effect; however, it is highly 
probable that, even if widely accepted values differ appreciably from the true value, the published results 
should eventually approach the true value despite the bandwagon effect. This should be manifest as a 
trend in the published Ro data: if this trend is statistically significant, the presence of the bandwagon 
effect can be suspected in the data. Fifty two determinations of Ro published over the last 20 years were 
analyzed. These data reveal no statistically significant trend, suggesting they are unlikely to involve 
any systematic uncertainty due to the bandwagon effect. At the same time, the published data show a 
gradual and statistically significant decrease in the uncertainties in the Ro determinations with time. 

1 Introduction 

Accurate knowledge of the distance between the Sun and the Galactic center Rq is very important for many 
problems associated with the structure and evolution of the Universe, as well as for astrometric applications. 
For example, the need to determine the effect of Galactic aberration on the proper motions of radio sources 
more precisely motivated our study of the accuracy of i?o determinations [1] . 

Like any quantity derived from observations, determinations of Rq have both random and systematic 
uncertainties. The systematic uncertainties are most dangerous in terms of using the corresponding results 
in various applications. Such systematic uncertainties could be due to instrumental and methodical un- 
certainties, or uncertainties in a theory used to interpret the measured values. Alongside these, subjective 
uncertainties that are not associated with the accuracy of the observations or data processing can arise. One 
example is the "bandwagon effect". This refers to the possibility that only measured results corresponding 
fairly well to the expected values are published; i.e., these are results that differ from commonly accepted 
expectations (or values published earlier) by a "reasonable" (usually small) amount. However, the fact that 
the overwhelming majority of published data were honestly derived from observations suggests that, even if 
they are filtered by publication selection, published values should gradually approach the true value. There- 
fore, the presence of a trend in the published determinations of some quantity may indicate a role played by 
the bandwagon effect (naturally, taking into account possible real variations of this quantity with time). 

In our case, a role of the bandwagon effect has been suspected in a number of reviews devoted to 
Ro determinations (e.g., [2-4]), which can be significantly subject to this systematic uncertainty. These 
suspicions were based on claims of systematic variation of the published Rq values with time. However, the 
trends obtained in [2-4] do not agree with each other (this is discussed in Section 3 in more detail). Moreover, 
many results refer to out-of-date publications that are almost irrelevant now. The current study is aimed 
at elucidating whether or not published values of Rq obtained over the past 20 years display appreciable 
time-dependent variations. 
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2 Determinations of Rq used in the analysis 



We used Rq determinations published between 1992 and 2011. Those interested in earlier data can refer to 
the reviews [2,3,5,6]. We used all published data except those refined later. Thus, we did not use the result 
of [7], revised in [8], the result of [9], revised in [10], and the result of [11], revised in [12] and later in [13]. 
In cases when the estimates of both random (e s t a t) and systematic (e sys t) uncertainties are available, 

we calculated the total uncertainty as the square root of the sum of their squares, e = \J e 2 stat + e 2 syst . The 

corresponding data are presented in Table [1] 

Table 1: Values of Rq for which random and systematic uncertainties are available 



Reference Rq, kpc 



[14] 


R{) 


= 7.52 ±0.10 


Ut ± 0.35 


\syst 


[15] 


Ro 


= 7.94 ±0.37 


Ut± 0.26 


\syst 


[16] 


Ro 


= 8.07 ±0.32 


Ut ±0.13 


\syst 


[13] 


Ro 


= 8.33 ±0.17 


Ut± 0.31 


\syst 


[17] 


Ro 


= 8.28 ±0.15 


Ut±0.29 


\syst 


[18] 


Ro 


= 8.24 ±0.08 


Ut ± 0-42 


\syst 


[19] 


Ro 


= 8.3 ±0.46 


Ut± i.o 


\syst 



When two versions of the Rq determinations are available, we averaged these values. These results are 
given in Table [2] 

Table 2: Mean values of Rq 



Reference Rq, kpc 



[20] 


8.7 ±0.7 


8.9 ±0.7 


[21] 


7.6 ±0.4 


8.3 ±0.5 


[22] 


7.9 ±0.85 


8.2 ±0.9 


[23] 


8.6 ±0.7 


8.8 ±0.4 


[24] 


7.95 ±0.62 


8.25 ±0.79 


[25] 


7.96 ±0.63 


8.36 ±0.37 


[26] 


7.7 ±0.7 


7.8 ±0.6 



Asymmetric confidence intervals, rather than rms uncertainties, are provided as estimates of the accuracy 
in some studies [25,27,28]. In these cases, the uncertainties are taken to be the mean values of the lower and 
upper bounds of the intervals. Since these bounds are close to each other in all cases, this substitution does 
not appreciably affect our results. 

The final list of all the Rq values used is presented in Table |31 

3 Data analysis 

The existence of a trend in the published Rq values has been suggested in various earlier studies [2-4,6,62]. 
A systematic time-dependent decrease in published Rq values during 1970-1990 was found in [2,3,6,62]. 
Although these data are mainly of historical interest now, this first suggestion of the presence of a bandwagon 
effect in Rq determinations seems to have been based on a time-dependent drift in the data [62]. At the 
same time, the data for 1990-1998 do not show any significant trend [6], while the data of [3] show a slight 
positive trend, i.e., a small increase in Rq values with time during 1990-2003. A large positive trend was 
found in [4] using results obtained in 1992-2010. 

These results are somewhat contradictory, although we must bear in mind that they sometimes refer 
to different observing intervals. The conclusions of the papers cited above are based on various samples of 
published results, so that they could be distorted by selection effects in the data used. 

To resolve this contradiction, we analyzed all available determinations of Rq indicated in the previous 
Section (see Fig. [I]). The epoch for a given R value, taken to refer to the date of publication, is plotted along 
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Table 3: Results of Rq determinations 



Rq, kpc 


STD 


Reference 


Rq, kpc 


STD 


Reference 


7.9 


0.8 


[291 

L 


8.05 


0.6 


[22] 

L J 


8.1 


1.1 


[301 


8.3 


0.3 


[31] 

L J 


7.6 


0.6 


[321 

L J 


7.7 


0.15 


[33] 

L J 


7.G 


0.4 


[341 

L J 


8.01 


0.44 


[35] 

L J 


8.09 


0.3 


[361 

L J 


8.7 


0.6 


[23] 

L J 


7.5 


1.0 


[371 


7.2 


0.3 


[38] 


7.0 


0.5 


[391 


7.52 


0.36 


[14] 

L J 


8.8 


0.5 


[201 

L W J 


8.1 


0.7 


[24] 

L J 


7.1 


0.5 


[401 

L J 


7.4 


0.3 


[41] 

L J 


8.3 


1.0 


[421 

L J 


7.94 


0.45 


[15] 

L J 


8.21 


0.98 


[431 

L "J 


8.07 


0.35 


[16] 

L J 


7.95 


0.4 


[211 

L J 


8.16 


0.5 


[25] 

L J 


7.55 


0.7 


[441 

L J 


8.33 


0.35 


[131 

L J 


8.1 


0.4 


[451 

L 


8.7 


0.5 


[28] 

L ^ J 


8.5 


0.5 


[461 

L J 


7.58 


0.40 


[471 

L ' J 


7.66 


0.54 


[481 

L W J 


7.2 


0.3 


[49] 

L J 


8.1 


0.15 


[501 


8.4 


0.6 


[51] 

L J 


7.1 


0.4 


[521 

L J 


7.75 


0.5 


[26] 

L ■ J 


8.51 


0.29 


[53] 


7.9 


0.75 


[27] 


8.2 


0.21 


[54] 


8.24 


0.43 


[18] 


8.6 


1.0 


[6] 


8.28 


0.33 


[17] 


7.4 


0.3 


[8] 


7.7 


0.4 


[55] 


7.9 


0.3 


[56] 


8.1 


0.6 


[57] 


8.67 


0.4 


[10] 


8.3 


1.1 


[19] 


8.2 


0.7 


[58] 


7.80 


0.26 


[59] 


8.24 


0.42 


[60] 


8.3 


0.23 


[61] 



the horizontal axis in Fig. Q] This epoch was determined by the month of publication in the corresponding 
journal issue number, or taken from the ADS bibliographic database (usually, for conference proceedings). 
If it was not possible to determine the month of publication, we adopted the middle of the year as the date 
of publication. When the epochs for two results coincided, we shifted them both by 0.02-0.03 yr in opposite 
directions to separate the corresponding points in the graph. 

We calculated two versions of a linear trend using the given data. In the first version, the weights for 
the original data were taken to be equal to the inverse squares of their uncertainties. The resulting trend is 
—0.0045 ± 0.0103 kpc/yr. We did not take the weights into consideration in the second version, resulting in 
the trend +0.0075 ± 0.0100 kpc/yr. 

A simple, but effective criterion can be used to detect trends or other low-frequency variations in the 
data, namely Abbe's criterion. This criterion is aimed at testing the hypothesis that all the mathematical 
expectancies of the analyzed measurements xi are equal. To test this hypothesis, we calculated the Abbe 
statistic, which is essentially the ratio of the Allan variance AV (whose applications to astronomical studies 
are described in detail in [63,64]) to the dispersion of the data D: 

AV 

117=1 (yj - y»+i) 2 m 

2(n-l) V ' 

17=1 ( x i ~ 
n-l 

where x is the mean of all the measurements x^. 

If there are appreciable low-frequency variations in the data, including trends, D will be appreciably 



Q = 
AV = 
D = 
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Figure 1: Values of i?o used in this study. The solid and dashed lines correspond to the weighted and 
unweighted versions of the calculated trends, respectively. 

greater than the Allan variance. Thus, if q is lower than the corresponding critical point of the Abbe distri- 
bution, the hypothesis that there is no trend must be rejected, and we conclude that there are statistically 
significant, systematic variations in the data. Our value is much greater than the 1% quantilc for the Abbe 
distribution, which is equal to 0.69 

Our conclusion based on all the calculations suggests that there are no statistically significant trends in 
the i?o determinations for the last 20 years. 

It is also interesting to consider how the accuracy of the Rq determinations changes with time. This 
analysis was carried out using the data shown in Fig. [5J yielding the trend —0.0103 ± 0.0053 kpc/yr. Thus, 
there is a statistically significant decrease in the uncertainty of the Rq determinations with time. 

This conclusion seems all the more interesting because researchers have recently begun to pay more 
attention to estimate of the uncertainties of their results, including analysis of both statistical (formal) 
and systematic uncertainties. On the one hand, the uncertainties in Rq should decrease with time as the 
observational data are accumulated and the observational and data-processing techniques are improved; on 
the other hand, these uncertainties could increase as new, more correct methods are used to estimate the 
accuracy. The former tendency has apparently been stronger in recent years. 

4 Conclusions 

Our analysis of 52 Rq determinations published in 1992-2011 has shown that these data do not display a 
statistically significant trend, i.e., a systematic increase or decrease in the Rq values with time. Therefore, we 
do not confirm the conclusions of Foster and Cooper [4], who found a large positive trend, i.e., a systematic 
increase in Rq values, over the last 20 years. At the same time, we confirm the results of [3,6] indicating a 
slight trend, although on a shorter time interval. 

The obvious origin of the discrepancy between our results and those of [4] is selection effects associated 
with the original data used in the analysis. This is clearly visible in Fig. |3l where our data and the data of 
[4] are shown. For some reason, the latter data did not include several Rq values above the mean during the 
first half of the interval, as well as several values below the mean value during the second half of the interval, 
yielding an appreciable increase in the published Rq values [4] . 
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Figure 2: Uncertainties of the i?o determinations, kpc. 
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Figure 3: Sets of i? values used here and in [4] (the latter ones are denoted by squares), kpc. 



In our opinion, our main conclusion that there is no statistically significant trend in the published Rq 
values over the last 20 years suggests that these data do not contain a statistically significant bandwagon 
effect. 

At the same time, we have discovered an appreciable decrease in the uncertainties of the Ro determinations 
with time. 
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